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Growing network models with preferential attachment dependent on both age and degree are 
proposed to simulate certain features of citation network noted in [1]. In this directed network, a 
new node gets attached to an older node with the probability ~ K(k)f(t) where the degree and 
age of the older node are k and t respectively. Several functional forms of K(k) and f(t) have been 
considered. The desirable features of the citation network can be reproduced with K(k) ~ k^ 13 and 
f(t) ~ exp(«t) with [3 — 2.0 and a — —0.2 and with simple modifications in the growth scheme. 
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I. INTRODUCTION 



The citation patterns of scientific publications form a 
rather complex network. Here the nodes are published 
papers and a link is formed if one paper cites another 
paper published previously. In [2] the citation distribu- 
tion of 783, 339 papers cataloged by Institute of Scientific 
Information (ISI) and also the 24, 296 papers published 
in Physical Review D (PRD) between 1975 and 1994 was 
studied. It was found that the probability P(k) that 
a particular paper is cited k times follows a power law 
distribution P(k) ~ fc -7 with exponent 7 = 3, indicat- 
ing that the incoming degree distribution of the citation 
network is scale-free. Later these studies were extended 
[3] to the outgoing degree distributions as well, and it 
was shown that it has an exponential tail in most cases. 

The citation distribution provides an interesting plat- 
form for theoretical modelling when the various features 
of citation dynamics are taken into account. It must be 
kept in mind that citation is possible only to papers that 
have been published previously, i.e, older papers, so that 
the network is directed. Also since most of the papers 
are gradually forgotten or become irrelevant, the proba- 
bility that a particular paper is cited should decrease in 
time unless it is of utmost importance. Again, a young 
paper, which is undergoing recognition, gains increasing 
attention through citations. Hence the model of a cita- 
tion network should be one in which aging of the papers 
occur such that the probability of a paper getting cited 
depends on its age. Again, from the scale-free nature of 
the degree distribution, it appears that the probability 
of a paper being cited at a given time is proportional to 
its in-degree. 

The distribution of ages of cited papers was studied 
for small sample sizes in [4] and [5] and the results from 
these two studies did not agree. The complete set of cita- 
tions for all publications in Physical Review (PR) jour- 
nals from July 1893 to June 2003 was later studied in 
[1] which perhaps gives the closest possible picture of the 
citation scenario. 



Among the various features of a citation network, those 
which are relevant to the present paper are listed below: 

(i) the distribution T(t) of ages t of citations to other 
publications: this is calculated from the difference of the 
year of publication of a particular paper and the year of 
publication of the papers which are cited by it. 

(ii) the distribution R(t) of citation ages t from citing 
publications calculated from the difference of the year of 
publication of a particular paper and the year of publi- 
cation of the papers citing it. 

Fig. 1 shows pictorially how the two distributions are 
generated. 
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FIG. 1. The two age distributions from a citation network. 
In I, the paper published at time to cites several papers pub- 
lished at different times ti , ti etc. The distribution of the 
intervals (to — ti) gives T(t). In II, the paper published at 
time to is cited by papers published at times ti,t% etc. The 
distribution of the intervals (ti — to) gives R(t). 

(iii) The correlation of the average age of citing pa- 
pers as a function of the degree k of that paper: this is 
denoted by A(k). It is expected that for a paper with 
many citations the average age of the citations will also 
be large such that there is a positive correlation between 
the two. 

For T{t) it was found that in the range of 2 to 15 years, 
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the distribution decays exponentially with time, while for 
longer times the decay is a slower exponential. For R(t), 
over the limited range of 2 to 20 years, the integrated data 
is consistent with a power law decay with an exponent 
~ — 1. Hence, authors tend to have an exponentially de- 
caying memory while citing papers, but the citation age 
distribution to a particular paper has a slower power law 
decay over an initial period of time (lifetime of the pa- 
per) . The PR data showed that there is indeed a positive 
correlation between average citation age and the number 
of times that a paper has been cited (property (hi)) and 
the relation is consistent with a power law. 

In the present paper we have attempted to obtain a 
suitable model for the citation network such that it may 
reproduce some of the main results that were obtained 
from the study of real citation networks. In section II, we 
give a brief review of time dependent networks, where we 
discuss the results of some earlier works. In section III, 
the results for R(t) from the known models are discussed 
and we find that these models are not appropriate for the 
citation network. In section IV, we propose a modified 
model which can reproduce some of the real results to 
an appreciable extent. Finally in section V, we provide 
a summary and also the conclusions of the present work. 



II. BRIEF REVIEW OF MODELS OF AGING 
NETWORKS 

The question of time dependence in the attachment 
probability of the incoming nodes in a growing network 
has been studied in a few theoretical models [4,6,7]. 
These models have basically evolved from the original 
Barabasi- Albert (BA) model [8] where in a growing net- 
work model, a new node gets linked to the existing ones 
following a preferential attachment to nodes with larger 
degree. In the time dependent models, a new node gets 
attached to older nodes with a preferential attachment 
which is dependent on the degree as well as the age of the 
existing node. We discuss briefly below some relevant 
age dependent models and the results thereof. 

In general, in all the models of aging networks, the 
attachment probability TI(/c, t) is taken to be a separable 
function of the degree k and the age t of the existing node 
such that 

n(M) =K(k)f(t). (l) 

In the Dorogovtsev-Mendes (DM) model [6], K(k) = k 
and f(t) = t a were considered. 

In this model the degree distribution was found to be 
scale free for values of a > —1. For a < 0, the age de- 
pendence presents a competing effect to the preferential 
attachment, but for a > 0, the older nodes get richer, 



enhancing the 'ricft gets richer'' effect. 

In [4] an exponential decaying function f(t) = exp(at) 
was chosen and it was found that the model is not scale- 
free for any negative value of a. 

In [7], the DM model was further generalised by in- 
corporating a power law variation of the degree in the 
attachment probability II, 

n(M) ~ k f H a . (2) 

A phase diagram was obtained for this model in the a — (3 
plane, with the phase boundary dividing the phase space 
into the small world and regular network regions. Scale 
free behaviour was found to exist only along a line for 
f3 > 1. In the small world region, there was gel forma- 
tion beyond (3—1, while the degree distribution was 
stretched exponential for [3 < 1, a < — 1. 



III. R(T) FROM STANDARD MODELS 

Evidently a time dependent model would be appropri- 
ate for the citation network. One can immediately realise 
that the time dependent part f(t) of the preferential at- 
tachment probability (1) is analogous to the function T(i) 
defined in section I. The task is to investigate whether as- 
suming an exponential decay in T(t) (i.e., (/(i)) gives us 
the proper behaviour of R(t). 

In our theoretical model, we first take two standard 
forms of time dependence in U(k,t) and look at the be- 
haviour of the corresponding R(t) using a numerical sim- 
ulation. The decay of f(t) is assumed to be (a) power 
law and (b) exponential. The choice of a power law be- 
haviour in the attachment probability may be regarded as 
of theoretical interest mainly as T(t) has been observed 
to have an exponential decay [9]. However, the power law 
model is quite well studied and it may be useful to get 
the results from both models and compare them with the 
real data. We also use a power law dependence of K(k) 
on k. 

The degree distribution has already been studied for 
most of these models. Therefore we are primarily inter- 
ested in calculating R(t), which is related to the degree 
distribution when its average is under consideration. 

In our simulations we have generated networks with 
2000 nodes and 10000 configurations for the power law 
time dependence of the attachment probability, while for 
the exponential time dependence, we have used a maxi- 
mum of 3000 nodes and 5000 configurations. 

Let the zth node born at time r» get R(t, n) links at 
time r. We are interested in the behaviour of R{t, t{) as 
a function of the corresponding age r — Tj = t. It may be 
noted that the cumulative sum 

Rcum(r,Ti) =H t t , =t .R(t' ,Ti) (3) 
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is a well-studied quantity in many networks as a function 
of r and r, and in many network models like the BA or 
DM model it behaves as 



t (T,T t ) =n(T/ n ) 



(4) 



where lZ(x) has a power law growth for large x, e.g. 



K(x) 



(p < 1). In more complicated models, 



e.g., accelerated models [10], R C um(T, Tj) may have a non- 
trivial dependence on both r and r^. In any case, as a 
function of i, R CU m will have a strong T\ dependence. For 
the distribution of the ages of citing papers, we there- 
fore find it more convenient to tag an arbitrary node and 
study the number of links R(t) it gets as a function of 
t suppressing the index r^. The price we pay for this is 
that since there is no averaging there is greater fluctua- 
tion. The node we tag also has to be an early one such 
that data over a long period is obtainable. 



In the following, we detail our findings from the simu- 
lations using two different schemes. 

III. I Scheme(a): In the first scheme, the attachment 
probability is given by H(k,t) ~ kPt a . We have simu- 
lated the network for (3 — 0.5, 1.0 and 2.0 and different 
values of a < 0. Throughout the simulations, we have 
tagged node number 10 (the results do not change if we 
change this number keeping it an early node). The (3=1 
case corresponds to the DM model. From the behaviour 
of R cum (r, 10) here, one can guess that R(t) will have a 
form 
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FIG. 2. R(t) vs t are shown for (3 = 0.5, 1 and 2. In figs 2a 
and 2b, the variations are shown for a = 0, —0.5, —1.0. Here, 
the variation is power law at large values of t only. For f3 = 2, 
variations are shown for a = —0.5 and —1.0. Here however, 
R(t) behaves differently; a power law variation exists for early 
t and dies out very soon to a saturation value (fig 2c). 



This behaviour is observed for large values of t and the 
agreement becomes worse as a becomes more negative. 
We are more interested in the small t behaviour here, 
which turn out to be far from a power law. 



For (3 = 0.5 once again we obtained a similar varia- 
tion of R(t). Power law regions might exist for a = —0.5 
and —1.0 with exponents ~ 0.8, 1.0 respectively. How- 
ever, (3 < 1 may not be a very interesting region as it has 
already been found that there is no scale-free behaviour 
here. 

For (3 = 2 behaviour of R(t) changes: there is ap- 
parently a power law region with exponent ^0.7 during 
early times and later it becomes a constant. The later be- 
haviour is not consistent with the citation results where 
R(t) decays rapidly for large t. These results for the three 
different (3 are shown in Fig. 2. 
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III. 2. Scheme (b): The attachment probability for the 
second scheme is given by 



II(M) ~ k p exp(at). 



(6) 



In [1] and [4] the behaviour of T(t) was found to be ex- 
ponentially decaying. We have therefore taken a model 
with f(t) — exp(ai) with a < 0. We have also gener- 
alised the model of [4] to include a nonlinear functional 
dependence of H(k, t) on k. This is because the f3 = 1 
case showed that there is no scale free region for negative 
a. A scale free region may only be obtained for values of 
(3 > 1 when a < 0. 
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FIG. 3. R(t) vs t data are shown at fi = 0.5, 1,2 respec- 
tively for a — —0.1,-0.2. Power law is not observed here at 
all. 

For the exponential time dependence in n(fc,i), once 
again we study the age distribution of the cita- 



tions to a tagged paper for the values of (3 — 0.5, 1.0, 2.0 
keeping a < 0. 

In this case, power law is not obtained anywhere for 
R(t). For each value of (3, we show in Fig. 3 R(t) for 
a = —0.1 and —0.2 (these values are comparable to the 
observed values). 



IV. A MODIFIED MODEL: R(T) AND OTHER 
RESULTS 



We are in search of a minimal model and find that the 
simple models described in the previous section are not 
sufficient. To add more features, we note that there are 
many differences between these models and a real cita- 
tion network, prominent among which are the following 

(i) In these models, only one paper is being cited by each 
paper 

(ii) In each year, it is being assumed in these models that 
only one paper is being published. (Note that the unit 
of time for the real data had been 1 year). 

Both these are gross simplifications and the real network 
is quite different. 

In order to make the smallest changes, we incorporate 
suitable modifications in the models described in section 
III such that only one of the two factors mentioned is con- 
sidered at a time. This way, it will be also be clear which 
are the indispensible features of the citation network. 

We take the exponential model where the attachment 
probability is given by (6) because we wish to proceed 
with a model in which the time dependent part in the at- 
tachment probability has an exponential decay to mimic 
reality. 

Keeping everything else same, when each new node is 
allowed to have more than one citation (typically 10 or 
20) we find that there is no significant change in the be- 
haviour of R(t). 

Next, again sticking to the exponential model with one 
citation, we consider M number of publications each year 
(M > 1). In the simulation, this means we are putting 
the time label differently, the first M nodes have r, = 1, 
the next M nodes t; = 2 etc. With M — 20, we find that 
the behaviour of R(t) is indeed a power law for t < 20, 
when the value of /3 = 2 and a = —0.1,-0.2 with an 
exponent p = 1.4 ± 0.1. (Fig. 4). Decreasing the value 
of /3, the power law behaviour worsens. There maybe 
some optimum values of (3 and a for which the value of 
the exponent p is closer to the observed 0.94 [1] or some 
more modifications of the basic model maybe required 
to achieve a better quantitative agreement. Our present 
objective is not to obtain precise values but rather to 
obtain the simplest possible model that has an exponen- 
tially decaying f(t) giving a power law decay in R(t). 

Once we have achieved the primary goal, it is impor- 
tant to find out the behaviour of the degree distribution 
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P{k) and the correlation between average age of citations 
A(k) to a paper and its degree k. Here we find that A(k) 
has a power law type increase as has been observed in 
[1] when M = 20 for (3 = 2 and a = -0.2. For M = 1, 
which corresponds to the model described in III. 2, it is 
definitely not a power law (Fig. 6). Hence at least two 
features of the present model are consistent with the ob- 
servations of [1]. 

Lastly, we check the degree distribution. For a few 
initial decades of k, it does give a fairly good agreement 
with a power law decay of the form P(k) ~ /c~ 7 with 
7 = 3. However, there is a increase in P(k) for very large 
k values which indicates a tendency to form a gel (Fig. 
5). In fact, the curvature of P(k) is opposite to that of 
the observed distribution reported in [2,3]. The possible 
reasons for this departure from reality is discussed briefly 
in the next section. 
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FIG. 6. This figure shows the degree distribution P(k) for 
,9 = 2,0= -0.1, -0.2 and M = 20 . The straight line with 
the slope indicated shows the closest fit for k < 100. 
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FIG. 4. R(t) vs t plot with redefined time, i.e, now M nodes 
are born in each year. Here M = 20. A power law behaviour 
is obtained for j3 = 2 at values of a — —0.1 (dashed line) and 
—0.2 (solid line) with exponent p = 1.4 ± 0.1. It is observed 
that as \a\ increases, the power law breaks down at an earlier 
t. 



< 



o 



beta=2, alpha 


=-0.1 




M=1 




M=20 








1 10 


100 


1000 



number of citations (indegree k) 
FIG. 5. Average citation age versus number of citations for 

M = 1, 5, 20, where M is the number of nodes born per time 

step. Here (3 = 2.0 and a = —0.2. As expected, there is a 

positive correlation between A(k) and k, and for larger values 

of M it fits to a power law dependence. 



V. SUMMARY AND CONCLUSION 



We have attempted to construct a simple model for ci- 
tation network in which the evolution rule is formulated 
according to the behaviour of real citation data. 

Since aging is an important factor in citation data, our 
emphasis has been on the age distribution of references 
made by a paper T(t) and made to a paper R(t). The 
interesting observation was that R(t) has a power law 
decay for early t while T(t) has an exponential decay, 
which is rather counter-intuitive. Indeed, the standard 
aging network models fail, but simple modification of the 
exponential model is able to reproduce the correct be- 
haviour of R(t), at least qualitatively. 

It is in general not quite easy to construct a single 
model of citation network which can reproduce all its 
features [3,11]. This may be due to certain distinctive 
features of the citation network of which we mention a 
few below. 

(i) Apart from mathematical quantities like the degree 
and age of a paper, the content of a paper is also impor- 
tant. Evidently a paper on a topic where a large number 
of people work, will get more citations ( that can be quan- 
tified by the impact parameter of a paper) . 

(ii) Neither the number of citations nor the number of 
papers published each year remains constant. 

(iii) In the models, one assumes smooth behaviour, e.g., 
of T(t) while in reality the variations are non-monotonic. 

(iv) There is a possibility of " death" of a papers, or the 
separate existence of dead and live papers as referred to 
in [12]. 

In our modified model, although we have obtained 
good agreement of the behaviour of R(t) and A(k), but 
for P(k) the behaviour does not agree very well with the 
observations. This may be because we have not opti- 
mised the values of /3 and a to get better agreement with 
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the real data and also due to the reasons stated above. [4] H. Zhu, X. Wang and J-Y. Zhu, Phys. Rev. E 68, 

056121(2003). 
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